Bayesian Normalization and Identification for Differential Gene Expression Data
نویسندگان
چکیده
Commonly accepted intensity-dependent normalization in spotted microarray studies takes account of measurement errors in the differential expression ratio but ignores measurement errors in the total intensity, although the definitions imply the same measurement error components are involved in both statistics. Furthermore, identification of differentially expressed genes is usually considered separately following normalization, which is statistically problematic. By incorporating the measurement errors in both total intensities and differential expression ratios, we propose a measurement-error model for intensity-dependent normalization and identification of differentially expressed genes. This model is also flexible enough to incorporate intra-array and inter-array effects. A Bayesian framework is proposed for the analysis of the proposed measurement-error model to avoid the potential risk of using the common two-step procedure. We also propose a Bayesian identification of differentially expressed genes to control the false discovery rate instead of the ad hoc thresholding of the posterior odds ratio. The simulation study and an application to real microarray data demonstrate promising results.
منابع مشابه
Bayesian Differential Analysis of Gene Expression Data
This paper describes a novel Bayesian method for the differential analysis of large scale gene expression data. The novelty of the method is the use of a contamination model that integrates the different sources of variability that affect gene expression data measured with microarray technology, thus removing the need for arbitrary normalization.
متن کاملModification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis
Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...
متن کاملValidation of Reference Genes for Real Time PCR Normalization in Milk Somatic Cells of Holstein Dairy Cattle
Real time-qPCR is the most reliable method for evaluation of mRNA expression levels. However, to obtain accurate results, selection of suitable reference genes is necessary for normalizing the real-time qPCR data. The aim of this research was to validate the expression stability of three potential reference genes (ACTB, GAPDH and UXT) in milk somatic cells of Holstein dairy cattle under differe...
متن کاملGene Identification from Microarray Data for Diagnosis of Acute Myeloid and Lymphoblastic Leukemia Using a Sparse Gene Selection Method
Background: Microarray experiments can simultaneously determine the expression of thousands of genes. Identification of potential genes from microarray data for diagnosis of cancer is important. This study aimed to identify genes for the diagnosis of acute myeloid and lymphoblastic leukemia using a sparse feature selection method. Materials and Methods: In this descriptive study, the expressio...
متن کاملSelection of suitable reference genes for real-time PCR studies of early developmental stages of sturgeons
In quantitative real-time PCR, the mRNA level can be quantified in relative terms based on the expression ratio of mRNAs of the target gene and an internal reference gene. Since, an internal standard should be expressed at a constant level among different tissues of an organism at all stages of development, and should be unaffected by the experimental treatment, the stability of different refer...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of computational biology : a journal of computational molecular cell biology
دوره 12 4 شماره
صفحات -
تاریخ انتشار 2005